An Efficient Data Replication Scheme for Hadoop Distributed File System
نویسندگان
چکیده
منابع مشابه
Efficient Data Replication Scheme based on Hadoop Distributed File System
Hadoop distributed file system (HDFS) is designed to store huge data set reliably, has been widely used for processing massive-scale data in parallel. In HDFS, the data locality problem is one of critical problem that causes the performance decrement of a file system. To solve the data locality problem, we propose an efficient data replication scheme based on access count prediction in a Hadoop...
متن کاملDelay Scheduling Based Replication Scheme for Hadoop Distributed File System
The data generated and processed by modern computing systems burgeon rapidly. MapReduce is an important programming model for large scale data intensive applications. Hadoop is a popular open source implementation of MapReduce and Google File System (GFS). The scalability and fault-tolerance feature of Hadoop makes it as a standard for BigData processing. Hadoop uses Hadoop Distributed File Sys...
متن کاملQoS-Aware Data Replication in Hadoop Distributed File System
Dr. Sunita Varma Department of ComputerTechnology and Application S. G. S. I. T. S. Indore, (M. P.), India [email protected] Ms. Gopi Khatri Department of Computer Engineering S. G. S. I. T. S Indore, (M. P.), India [email protected] --------------------------------------------------------------------ABSTRACT------------------------------------------------------------Cloud computin...
متن کاملBig Data Analytics: An Approach using Hadoop Distributed File System
Today’s world is driven by Growth and Innovation for a better future. All of which are based on analysis and harnessing of tons of data, typically known as Big Data. The tasks involved for achieving results at such a scale can be challenging and painfully slow. This paper works towards an approach for effectively solving a large and computationally intensive problem by leveraging the capabiliti...
متن کاملGoogle File System and Hadoop Distributed File System - An Analogy
Big Data has indeed been the word which IT Industry is talking about lately. With advancement of automation and data being processed in real time, it has now become a necessity for companies to look forward to sustainable solutions to store their huge datasets and compute valuable information out of it. High performance computing heavily relies on distributed environments to process large chunk...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Engineering & Technology
سال: 2018
ISSN: 2227-524X
DOI: 10.14419/ijet.v7i2.32.15396